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1. Introduction 

Q 
C/3 \ Let C{x±, ..., x n } denote the ring of power series whose coefficients increase slowly 

enough so that the series converges in a neighborhood of the origin in C n . Suppose 

f(x, y) G C{x, y} with /(0, 0) = 0. Then one version of Puiseux's theorem is the statement 

that there exists a factorization 
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/(»,!/) = w(z,y)x c JJ(y-0<(a;)) (1-1) 
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Here for some natural number n, each gi G C{x~ } with ^(0) = 0, and u(x, y) G C{x~ , y« } 
with w(0, 0) 7^ 0. Hence the zeroes of f(x,y) are parameterized by analytic functions of 
one variable. (With a little more effort one can show u(x,y) G C{x,y}). One method 

p to prove the factorization (1.1) goes back to Isaac Newton himself. Newton's method 

produces the terms of the g^ (x) through an infinite recursion; in modern treatments one 
then shows the resulting power series converges in a neighborhood of the origin. The 
latter is normally done by invoking a topological argument involving Riemann surfaces 



(see [BK]). Alternatively, one may carefully examine the properties of Newton's algorithm 
as one proceeds and then directly prove that the resulting g%(x) are in some C{x~}; this 
is done in [Ca] and [Ch] (Puiseux's original proof was somewhat different). 

r^ The purpose of this paper is to provide an argument based on Newton's method 

and some ideas from resolution of singularities that gives a quick proof of the factorization 
(1.1) (including the convergence of the gi(x)). It is then shown that similar ideas can be 
used to give a short proof of the existence of smooth adapted coordinates in two dimensions 
(Theorem 1.2 below). This result was first proved in the real-analytic case by Varchenko 
[V] and then recently for the general smooth case by Ikromov-Muller [IM]. These proofs 
use detailed information about the zero set of S(x, y). 

The arguments of this paper will use only the two-dimensional implicit function 
theorem and some basic properties of Newton polygons; they are however inspired by 
more modern resolution of singularities ideas as will be discussed at the end of section 2. 
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It should also be pointed out that if one is willing to assume the Weierstrass preparation 
theorem and Hensel's lemma, there exist short and rather different elementary proofs of 
Puiseux's theorem of a more algebraic nature. We refer to [N] for more information. 

We make extensive use of the following object, essentially used in Newton's letter. 

Definition 1.2. Let Yl a b s abX a y b be a power series in x~ and y~ for some positive integer 
n, and assume that at least one s a b is nonzero. For any (a, b) for which s a b 7^ 0, let Q a b 
be the quadrant {(x,y) G R 2 : x > a,y > b}. Then the Newton polygon N(S) is defined 
to be the convex hull of all Q a b- 

The boundary of a Newton polygon consists of finitely many (possibly zero) 
bounded edges of negative slope as well as an unbounded vertical ray and an unbounded 
horizontal ray. We also will make use the following. 

Definition 1.3. Let S(x,y) be as in Definition 1.2. The Newton distance d(S) is defined 
by d(S) = inf{d : (d, d) G N(S)}. 

Definition 1.4. Suppose e is a compact edge of N(S). Define S e (x,y) by S e (x,y) = 
Xl( a b)ee s a,bX a y b ■ In other words S e (x, y) is the sum of the terms of the Taylor expansion 
of S over all (a, b) G e. 

In the study of oscillatory integrals in two dimensions, the notion of adapted 
coordinates plays an important role. 

Definition 1.5. A coordinate system is said to be adapted if d(S) = sup Q d(S o a), where 
the supremum is taken over all smooth coordinate changes a defined in a neighborhood of 
(0,0) such that a(0,0) = (0,0). 

The significance of adapted coordinate systems is the following. Consider the 
oscillatory integral 

J X = f e lXS ^ y U(x,y)dxdy (1.2) 

J-R? 

Assume S(0, 0) = and that S has a critical point at the origin; that is, V5'(0, 0) = 0. The 
function (f>(x, y) is a cutoff function in a neighborhood of the origin and A denotes a real 
parameter which one assumes to be large. Then the best (supremal) e for which one has 
the estimate \J\\ < Cln(2 + |A|)|A| _e for all A and all (f)(x,y) supported in a sufficiently 
small neighborhood of the origin has the nice form e = -^tW if and only if S(x, y) is in an 
adapted coordinate system. This was proved by Ikromov, Kempe, and Miiller in [IKM]. 
For the real-analytic case, where one considers real-analytic phase and takes the supremum 
of (1.2) over real-analytic coordinate changes, the corresponding result was earlier proven 
by Varchenko in [V]. 

Theorem 1.1. Suppose S has nonvanishing Taylor expansion at (0,0) and S"(0,0) = 0. 



A coordinate system is adapted if any of the following three cases hold. 

Case 1. The line y = x intersects N(S) in the interior of a bounded edge e and any real 
zero r of S e (l, y) or S e (— 1, y) with r^O has order less than d(S). 

Case 2. The line y = x intersects N(S) at a vertex (d, d). 

Case 3. The line y = x intersects N(S) in the interior of one of the unbounded edges. 

Proof: By the main theorem of [G], if U is a small enough neighborhood of the origin, 
and eo denotes the supremum of the numbers e for which J v \f\~ e is finite, then d(S) < j-, 
with d(S) = — in cases 1, 2, and 3. Hence if one is in cases 1, 2, or 3, one is in adapted 
coordinates. 

Theorem 1.2. Suppose S has nonvanishing Taylor expansion at (0,0) and S^O, 0) = 0. 
Then there exists some coordinate system in a neighborhood of the origin such that one 
of Cases 1, 2, or 3 hold. Hence there exists an adapted coordinate system for S(x, y). The 
associated coordinate change can always be taken to be of the form (x, y) — ► (x, y — ?p(x)) 
or (x, y) — ► [x — V'(y)) y) f° r a smooth ifj. 

In [IM], a slightly weaker version of Theorem 1.2 is proven which also shows that 
for any smooth phase there exists a smooth adapted coordinate system. The arguments of 
[IM] use Puiseux's theorem and do a careful analysis of the different gi(x). The proof of 
Theorem 1.2 is in section 3. It should be also pointed out that Theorem 1.1 follows from 
Theorem 3.3 of [IM]. 

2. Proof of Puiseux's Theorem. 

Suppose f(x,y) G C{x,y}. After factoring out the largest possible power of x 
out of f(x,y), we can write f(x,y) = x c g(x,y), where 9^(0,0) 7^ for some e. Since 
(1.1) trivially holds if e is zero, we can assume e > 0. Assuming e to be chosen minimal, 
we have that (0, e) is on the Newton polygon N(g). We will prove Puiseux's theorem by 
proving the following theorem: 

Theorem 2.1. Suppose that h(x,y) = J2 a b h a bX a y b G C{x^,y} such that h(0,0) = 
and such that (0, E) G N(h) for some E > 0. Then one has a factorization h(x, y) = 
H(x,y)(y — g(x)) where for some natural number N, H(x,y) G C{x^ ,y} and g G C{x^} 
with 0(0) = 0. 

Puiseux's theorem follows by applying Theorem 2.1 repeatedly; (0, E—l) G N(H) 
and thus starting with g(x,y), after e iterations one has (1.1). 

Proof of Theorem 2.1. If y divides h(x,y) we are done, so we may assume that there 
is some point (D, 0) on the Newton polygon N(h) with D > 0. Let (p, q) denote the 



vertex of N(h) with q > such that q is minimal. Thus the segment e connecting (p, q) to 
(D, 0) is an edge of N(h). Let h e (x, y) denote the sum of the terms h a bX a y b of the series 
h(x,y) = J2 a i,h a } ) x a y b such that (a, b) is on e. Thus h e (x,y) is a polynomial in x~ and 
y. Write the equation of the edge e as x + my = a. Hence if h a \,x a y h appears in h e (x,y) 
then a + mb = a. We factor out x a , writing h e (x, y) = x a h' e (x, y). Each term of h' e (x, y) 
is now of the form h a bX a ~ a y h and (a — a) + mb = or (a — a) = —mb. Thus we have 

WV = h ab (^) b (2.1) 

x m 

Conequently for a polynomial P(z), we can write 

h e {x,y) = x a P{^-) (2.2) 

The proof of Theorem 2.1 will now proceed by an inductive process. At each stage we will 
perform a coordinate change of the form (x,y) — ► (x, y + a(x)) for some a(x) G C{x"~} 
with a(0) = 0. The resulting function h(x, y + a(x)) will fall into one of the following two 
(not mutually exclusive) cases. 

Case 1: y divides h(x,y + a(x)). 

Case 2: h(x, y+a(x)) satisfies the hypotheses of Theorem 2.1 and the second-lowest vertex 
(p" , q") of the Newton polygon of h(x, y + a(x)) satisfies q" < q. 

In the first case, one transfers back to the original coordinates and we have the 
conclusions of Theorem 2.1. In the second case, one is back under the assumptions of 
Theorem 2.1 and thus can repeat the upcoming argument, finding the next a(x). Since 
q" < q, after at most q iterations one will have to be in the first case and we will be done. 

So our task is to show that under the assumptions of Theorem 2.1 we can always 
find an a(x) such that one of the two cases holds. Suppose first that the polynomial P has a 
(complex) root r of order q' < q. Then the function P(z + r) has a root at z = of order q'. 
Hence there is a term of h e (x,y + rx m ) = x a P(-^ + r) with y appearing to the g'th power, 
but no terms with y appearing to a lower power than q' . Define H(x, y) = h(x, y + rx m ). 
Note that H e (x, y) = h e (x, y + rx m ). Thus a segment of the line x + my = a is an edge of 
the Newton polygon N(H) of H, as was the case for h. However, instead of going down 
to (D,0), for H the segment terminates at {p',q') for some p' . Hence either (p',q') is the 
lowest vertex of N(H), in which case one is in Case 1 with a(x) = rx m , or the second- 
lowest vertex (p",q") of N(H ) (which could be (p',q')) satisfies q" < q' < q. Therefore if 
we let a(x) = rx m , h(x,y) is in case 2 and we are done. 

Thus it remains to analyze the situation where P(z) has a single complex root r 
of order q. Here P(z) = c(z — r) q for some c. This is the situation where Newton's method 
gives an infinite iteration; here we will do something different. We look at the function 
h(x,x rn y). Since x + my = a is a supporting line for N(h), the terms of h e (x,x m y) are 



the terms of h(x,x m y) with with the lowest power of x appearing. Since h e (x,x rn y) = 

m 

x a P(^-^-) = cx a (y — r) 9 , for some e > we may write 

h(x, x m y) = cx a {y - r) q + x a+e l(x, y) (2.3) 

By (2.3), the function h'(x,y) = a is in C{x^',y} for some N and we have 

ti(x,y) = c(y-r) q + x e l(x,y) (2.4) 

The trick is now as follows. The function d q J\ has a zero at (0, r), but has non- vanishing 
y derivative there. Hence by applying the 2-dimensional implicit function theorem (tech- 
nically to Q q J\ (x N , y)) : one has that there is some function k(x) G C{x"~ } with fc(0) = r 

such that d d q J\ (x, k(x)) = near the origin. One now defines H(x, y) = h(x : y + x m k(x)). 
The fact that allows the induction to proceed is that 

aq—lzr f)1~ 1 h f) q ~ 1 h' 

W^^ = dy^^ Xmk{x)) = X m -dy^^ Hx)) ^° (2 - 5) 

Like before, the coordinate change is such that x + my = a is still a supporting line for 
N(H). This time it intersects N(H) in the single vertex (p, q). This may be easiest to see 
from (2.3) using the fact that in the coordinates of (2.3) the coordinate change is of the 
form (x, y) — > (x,y + r + k(x)) where k(0) = 0. 

If y divides H we are back in case 1 and we are done. So we may assume there is 
some vertex (d', 0) on N(H) with d' > 0. If (p, q) is anything other than the second-lowest 
vertex, we are in case 2 and thus we'd be done again. Hence we can assume that the 
segment e' connecting (p, q) to (d' , 0) is an edge of N(H). The condition (2.5) ensures that 
H e r (x, y) cannot have a single complex root of order q; for if this were to happen like above 
H e '(x,y) would be of the form cx a (— ^r — r') q . But this expression has a nonvanishing 
y q ~ 1 term; this contradicts (2.5) which implies that for every a the Taylor series coefficient 
H aq -i is zero. Hence H e >(x,y) must have a root of order less than q. We dealt with this 
situation above; a further coordinate change of the correct form puts us in case 1 or 2. 
This completes the proof of Theorem 2.1. 

Those familiar with resolution of singularities algorithms can recognize this idea 
of taking the zero set of the (q — l)st derivative of a function and making it a hyperplane, 
so that an inductive procedure may continue. So essentially what is happening here is 
that an argument of this type is being incorporated into Newton's method to construct a 
process that terminates after finitely many applications of the implicit function theorem 
rather than an infinite iteration. 

3. Proof of Theorem 1.2. 

We now assume that S(x, y) is a smooth function defined in a neighborhood of 
the origin with 5(0,0) = and having a nonvanishing Taylor expansion at (0,0). Let 



N(S) denote the Newton polygon of this Taylor expansion. Assume we are not in any 
of the three cases of Theorem 1.2. Thus the line y = x intersects the Newton polygon 
N(S) in the interior of a compact edge e, and S e (l, y) or S e (—1, y) has a real zero r ^0 
of order k > d(S). Replacing x by — x and/or y by — y if necessary, we may assume r > 
is a zero of S e (l,y) of order k. The goal is to do a coordinate change of the proper form 
that puts us into one of these three cases. Denote the equation of the line containing e by 
x + my = a. Exactly as (2.2), there is some polynomial Q(y) such that for x > we have 

S e (x,y) = x a Q(±) 

Plugging in x = 1, we see that Q(y) = S e (l,y). Hence S e (x,y) has zeroes of order k on 
the curve y = rx m . This implies that S e (x, 1) has a zero of order k at x = r~~ . As a 
result, we may switch the roles of the x and y axes if necessary and assume that m > 1; 
this makes our subsequent arguments somewhat easier. 

Next, we show that m must in fact be an integer. To see this, note that if m 
were not an integer, then the degrees of the powers of y appearing in S e (l,y) would have 
to be separated by at least 2. Hence S e (l, y) would have to be of the form yPR(y c ) for 
some (3 > 0, c > 2, where R is a polynomial. Next, since (d(S), d(S)) is on N(S), we have 
a = (1 + m)d(S). Since m > 1 when m > 1 is not an integer, the maximum possible value 
of y on the line x + ray = (l + m)d(S) for x,y > is m ^-d(S) < 2d(S). Thus the degree of 

yPR(y c ) is less than 2d(S) : and hence the degree of R(y) is less than — — < d(S). Hence 
the zeroes of R(y) are of order less than d(S), implying the zeroes of of S e (l, y) = y ,3 R(y c ) 
other than y = are of order less than d(S). This contradicts our assumption that S e (l,y) 
has a zero of order k > d(S) and we conclude that m is an integer. 

Note that if m is even, then S e (l,y) = ±S e (—l,y), while if m is odd one has 
S e (l,y) = ±S e (—l,—y). Hence both S e (l,y) and S e (—l,y) have a zero of order k; we 
never really had to replace x by — x in the above. The preceding arguments thus imply: 

Fact: If one is not in adapted coordinates and m > 1, then m is an integer and Q(y) = 
S e (l,y) has a zero of order k > d(S). 

We now come to the main argument; we will prove the existence of a coordinate 
change of the form (x, y) — ► (x, y + a(x)), a(x) smooth, that puts us into one of the three 
cases. (The coordinate change (x,y) — » (x + a(y),y) corresponds to m < 1). We proceed 
as follows. Let (p,q) denote the upper vertex of the edge e; necessarily q > d(S). We will 
find a smooth function a(x) such that S'(x, y) = S(x, y + a(x)) is in one of the following 
two (not mutually exclusive) categories. 

Category 1: S'(x, y) is in one of the three cases of adapted coordinates. 

Category 2: The line y = x intersects the interior of an edge e' of N(S') with equation 
x + va'y = a', m' > 1, such that the upper vertex (p', q') of e' satisfies q' < q. 



Theorem 1.2 will then follow; there can be at most q iterations of category 2. 

The arguments now resemble those of section 2. We first consider the case where 
the order k of the zero r of Q(y) = S e (l, y) satisfies k < q. Here we let a(x) = rx m , and 
thus S'(x, y) = S(x, y + rx m ). Then x + my = a is a supporting line of N(S') as it was for 
N(S), and like in section 2 there is an edge E of N(S') on this line whose upper vertex is 
(p,q). Note that S' E (x,y) = S e (x,y + rx m ) = x a Q(-p^ + r). Since Q has a zero of order 
k at r, the lowest power of y appearing in S' E (x,y) is y k , and therefore £"s lower vertex 
is at a point (j, k) for some j. Since both vertices of E have ^/-coordinates at least d(S), 
they are both in the portion of x + my = a on or above (d(S),d(S)). Thus the edge E 
lies wholly on or above the line y = x. If the line y = x intersects N(S') at a vertex or 
inside the horizontal or vertical rays, one is in Category 1. Otherwise, it must intersect 
N(S') in the interior of an edge e' whose upper vertex is either (j, k) or a lower vertex. 
And because x + my = a is a supporting line for N(S') and e! lies below E, e' will have 
equation x + m'y = a' for some m' > m > 1. Thus we are in category 2. Hence when 
k < q we are in either Category 1 or 2 and we are done. 

It remains to consider the situation where r is a zero of Q(y) of order q. In this 
case we have Q(y) = c(y — r) q for some c. For a large integer n we expand S(x,y) as 

S(x, y) = cx a (^- r) q + T n (x, y) + E n (x, y) (3.1) 

Here the polynomial T n (x,y) are the terms of <S"s Taylor expansion with exponents less 
than n. For all < (3, 7 < n one has 

r^+ 7 E 

Analogous to (2.3), one has 

S(x, x m y) = cx a (y - r) q + x a+1 T' n {x, y) + E n (x, x m y) (3.3) 

Here T^(x, y) is also a polynomial. Analogous to (2.4) we define s(x, y) = {x,x a y> , so that 

s(x, y) = c{y - r) q + xT' n (x, y) + x~ a E n (x, x m y) (3.4) 

We claim that the function s(x,y) is smooth on a neighborhood of (0,r). Off the y-axis 
smoothness holds because S(x, y) is smooth. One can show that a given derivative of 
s(x,y) exists when x = and equals that of c(y — r) q + xT^(x,y) for large enough n 
by examining the difference quotient of a one-lower order derivative of (3.4), inductively 
assuming this lower-order derivative exists and has the right value when x = 0. Equation 
(3.2) ensures that the difference quotient of the lower derivative of x~ a E n (x,x rn y) tends 
to zero as x goes to zero. We conclude that s(x,y) is smooth on a neighborhood of (0,r). 



Analogous to after (2.4), we next use the smooth implicit function theorem on 
g g _f and find a smooth function k(x) defined in a neighborhood of x = such that 
k(0) = r and | g _f (x, fe(x)) = 0. Transferring this back to S(x, y), as in (2.5) we have 

d q ~ 1 S 

-^ 1=T (x,x m k(x)) = (3.5) 

Thus if we let a(x) = x m k(x) and S'(x,y) = S(x,y + x rn k(x)), for all x we consequently 
have 

^__(*,0) = (3.6) 

Thus for every a the Taylor series coefficient S' aq _ 1 is zero. 

Next, since x + my = a is a supporting line for N(S), analogous to after (2.5) 
this line is also a supporting line for N(S') and intersects N(S') at the single vertex (p, g). 
If S"(x, y) is in adapted coordinates we are in Category 1 and have nothing to prove, so we 
may assume the coordinates are not adapted. Let e' denote the edge of N(S') intersecting 
the line y = x and denote its equation by x + m'y = a'. If the upper vertex (p',q') of 
N(S') satisfies q' < q, one is in Category 2 and we are done. So we assume this upper 
vertex is (p, q) itself. Also, since e' lies within the set x + my > a and is no higher than 
the vertex (p, q) of N(S') that is on the supporting line x + my = a, we have m' > m > 1. 

If S' e ,(l,y) has a real zero r' ^ of order less than q, one is in the situation 
above (3.1); there is a smooth b(x) such that S'(x,y + b(x)) = S(x,y + a(x) + b(x)) is in 
Category 1 or 2 as needed. The only other possibility is that S' e ,(l,y) has a single zero 
r' 7^ of order q. But like at the end of section 2, this cannot happen. For this would 
imply S' e ,(x,y) = c'x a (— ^r — r') q has a nonvanishing y q ~ x term. Consequently, for some 
a the Taylor series coefficient S' aq _ 1 would be nonzero, contradicting (3.6). Thus the case 
where S' e ,(l, y) has a single zero of order q does not occur, and we are done. 
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